IN THE CLAIMS 

Please amend the claims as follows. 

For the Examiner's convenience, a list of all claims is included below. 

1. (Currently Amended) A machine-implemented method comprising: 
extracting portions from speech segments, the portions surrounding a segment 

boundary within a phoneme; 

identi fying time samples from the portions; 

constructing a matrix W containing first data corresponding to the time samples 
from the portions surrounding the segment boundary within the phoneme and second data 
corresponding to t he portions ; and deriving feature vectors that represent the portions in a 
vector space by decomposing the matrix W containing the first data corresponding to the 
time samples from the portions surrounding the segment boundary within the phoneme 
and the second data corresponding to the portions , such that at least phase information of 
tbepertiensis ■ ■ preserved in the feature vectors ; and 

determining a distance between the feature vectors in the vector space. 

2. (Canceled). 

3. (Previously Presented) The machine-implemented method of claim 1, wherein 
decomposing the matrix W comprises extracting global boundary-centric features from 
the portions. 
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4. (Previously Presented) The machine-implemented method of claim 1, wherein 
the speech segments each include the segment boundary within the phoneme. 

5. (Original) The machine-implemented method of claim 4, wherein the speech 
segments each include at least one diphone. 

6. (Original) The machine-implemented method of claim 5, wherein the portions 
include at least one pitch period. 

7. (Original) The machine-implemented method of claim 6, wherein decomposing 
the matrix W comprises performing a pitch synchronous singular value analysis on the 
pitch periods of the time-domain segments. 
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8. (Previously Presented) The machine-implemented method of claim 6, wherein 
the matrix W is a 2KM x N matrix represented by 

W= UZV T 

where K is the number of pitch periods near the segment boundary extracted from each 
segment, N is the maximum number of samples among the pitch periods, M is the number 
of segments in a voice table having a segment boundary within the phoneme, U is the 
2KM x R left singular matrix with row vectors (1 < i < 2KM),Z\s the RxR diagonal 
matrix of singular values si > s 2 > . . . > sr > 0, V is the ./V x R right singular matrix with 
row vectors vy (1 < j < N), R « 2KM, and T denotes matrix transposition, wherein 
decomposing the matrix W comprises performing a singular value decomposition of W. 

9. (Original) The machine- implemented method of claim 8, wherein the pitch 
periods are zero padded to N samples. 

10. (Original) The machine-implemented method of claim 9, wherein a feature vector 
Ui is calculated as 

Hi = UiZ 

where m is a row vector associated with a pitch period i, and 27 is the singular diagonal 
matrix. 

11. (Original) The machine-implemented method of claim 10, wherein the distance 
between two feature vectors is determined by a metric comprising the cosine of the angle 
between the two feature vectors. 
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12. (Original) The machine-implemented method of claim 11, wherein the metric 
comprises a closeness measure, C, between two feature vectors, Uk and Hi , wherein C is 
calculated as 



C(u k , ui) = cosO^, m/Z) = 



'Ml Ml 

for any 1< k,l< 2KM. 



13. (Original) The machine-implemented method of claim 12, wherein a difference 
d(Si,S 2 ) between two segments in the voice table, Si and S 2 , is calculated as 

d(S h S 2 ) = do(puqi)=l-C(u P i, 
where J 0 is the distance between pitch periods pj and q u p\ is the last pitch period of Si, 
qi is the first pitch period of 52, u P i is a feature vector associated with pitch period pi , 
and uqi is a feature vector associated with pitch period qi. 



14. (Original) The machine-implemented method of claim 13, wherein the 
calculation for the difference between two segments in the voice table, S\ and S 2 , is 
expanded to include a plurality of pitch periods from each segment. 



15. (Original) The machine-implemented method of claim 13, wherein the difference 
between two segments in the voice table, Si and S 2 , is associated with a discontinuity 
between Si and S 2 . 
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16. (Original) The machine-implemented method of claim 12, wherein a difference 

cKSuSz) between two segments in the voice table, Si and S 2 , is calculated as 

d(S h S 2 ) = I d 0 (p u q{) - d 0 (p u p{) + d Q (q u q 1) | = | C(u P i , m) + C( u q i , Uqi ) - C( u P i ,u q i)\ 
2 2 

where d 0 is the distance between pitch periods, p\ is the last pitch period of Si , p 1 is the 

first pitch period of a segment contiguous to S\ , q\ is the first pitch period of S 2 , q 1 is 

the last pitch period of a segment contiguous to S 2 , u P i is a feature vector associated with 

pitch period p\ , u q i is a feature vector associated with pitch period q\ , w^i is a feature 

vector associated with pitch period p { , and Tiq\ is a feature vector associated with pitch 

period q 1 . 

17. (Previously Presented) The machine-implemented method of claim 1, further 
comprising associating the distance between the feature vectors with speech segments in 
a voice table. 

18. (Original) The machine-implemented method of claim 17, further comprising: 
selecting speech segments from the voice table based on the distance between the 

feature vectors. 

19. (Original) The machine-implemented method of claim 5, wherein the portions 
include centered pitch periods. 
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20. (Previously Presented) The machine-implemented method of claim 19, wherein 
the matrix Wis a (2(^-1 )+l)M x N matrix represented by 

W= UZV T 

where K-l is the number of centered pitch periods near the segment boundary extracted 
from each segment, N is the maximum number of samples among the centered pitch 
periods, M is the number of segments in a voice table having a segment boundary within 
the phoneme, U is the (2(K-l)+l)M x R left singular matrix with row vectors 
Ui (1 < i< {2{K-\)+\)M), 27 is the R x R diagonal matrix of singular values si > s 2 > ... 
> sr > 0, V is the N x R right singular matrix with row vectors vj (1 < j< N), R « (2(K- 
1)+1)M), and T denotes matrix transposition, wherein decomposing the matrix W 
comprises performing a singular value decomposition of W. 

21. (Original) The machine- implemented method of claim 20, wherein the centered 
pitch periods are symmetrically zero padded to N samples. 

22. (Original) The machine-implemented method of claim 21, wherein a feature 
vector Hi is calculated as 

Ui = UiZ 

where Ui is a row vector associated with a centered pitch period i, and 27 is the singular 
diagonal matrix. 
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23. (Original) The machine-implemented method of claim 22, wherein the distance 
between two feature vectors is determined by a metric comprising a closeness measure, 
C, between two feature vectors, Uk and Hi , wherein C is calculated as 



C(uk , ut) = cos(uk^, uiZ) = 



'Ml Ml 

for any 1< Jt,Z< (2(K-l)+l)M. 

24. (Original) The machine-implemented method of claim 23, wherein a difference 
d{S\,S2) between two segments in the voice table, S\ and S2, is calculated as 

d(S h S 2 ) = C(U 7i-i , U So) + C( U So , U CFi) - C( U fl_ u u - C( U , U (ji) 
where U71-1 is a feature vector associated with a centered pitch period 71 -\ ,uSo is a 
feature vector associated with a centered pitch period £0 ,U(ji is a feature vector 
associated with a centered pitch period <Ji ,Ujto is a feature vector associated with a 
centered pitch period 71 0 , and U (j 0 is a feature vector associated with a centered pitch 
period (Jo . 

25. (Currently Amended) A machine-readable medium having instructions to cause a 
machine to perform a machine-implemented method comprising: 

extracting portions from speech segments that surround a segment boundary 
within a phoneme; 

identifying time samples from the portions; 
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constructing a matrix W containing first data corresponding to the time samples 
from the portions surrounding the segment boundary within the phoneme and second data 
corresponding to the portions ; and deriving feature vectors that represent the portions in a 
vector space by decomposing the matrix W containing the first dai u rresponding to the 
time samples from the portions surrounding the segment boundary within the phoneme 
and the second data t orre pon ding to the portions , such that at least phase info rmation of 
■ tfae - per t- k^ and 

determining a distance between the feature vectors in the vector space. 

26. (Canceled). 

27. (Previously Presented) The machine-readable medium of claim 25, wherein 
decomposing the matrix W comprises extracting global boundary-centric features from 
the portions. 

28. (Previously Presented) The machine-readable medium of claim 25, wherein the 
speech segments each include the segment boundary within the phoneme. 

29. (Original) The machine-readable medium of claim 28, wherein the speech 
segments each include at least one diphone. 

30. (Original) The machine-readable medium of claim 29, wherein the portions 
include at least one pitch period. 
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31. (Original) The machine-readable medium of claim 30, wherein decomposing the 
matrix W comprises performing a pitch synchronous singular value analysis on the pitch 
periods of the time-domain segments. 

32. (Previously Presented) The machine-readable medium of claim 30, wherein the 
matrix W is a 2KM x N matrix represented by 

W= UZV T 

where K is the number of pitch periods near the segment boundary extracted from each 
segment, N is the maximum number of samples among the pitch periods, M is the number 
of segments in a voice table having a segment boundary within the phoneme, U is the 
2KM x R left singular matrix with row vectors «, ( 1 < i < 2KM),Zis the R x R diagonal 
matrix of singular values s\ > S2> ■■■> sr > 0, V is the N x R right singular matrix with 
row vectors y,- (1 < j < AO, R « 2KM, and T denotes matrix transposition, wherein 
decomposing the matrix W comprises performing a singular value decomposition of W. 

33. (Original) The machine-readable medium of claim 32, wherein the pitch periods 
are zero padded to Af samples. 

34. (Original) The machine-readable medium of claim 33, wherein a feature vector 
is calculated as 

Ui = UiZ 

where m is a row vector associated with a pitch period i, and 27 is the singular diagonal 
matrix. 
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35. (Original) The machine-readable medium of claim 34, wherein the distance 
between two feature vectors is determined by a metric comprising the cosine of the angle 
between the two feature vectors. 



36. (Original) The machine-readable medium of claim 35, wherein the metric 
comprises a closeness measure, C, between two feature vectors, Uk and Hi , wherein C is 
calculated as 

C(u k , ui) = cos(u k Z, uiE) = -r. — u l jj p a 
for any 1< k,l< 2KM. 



37. (Original) The machine-readable medium of claim 36, wherein a difference 
d(Si,S2) between two segments in the voice table, S\ and 52, is calculated as 

d(S h S 2 ) = d 0 (pu qi)=l- C(u P i , u q \ ) 
where do is the distance between pitch periods p\ and q\,p\ is the last pitch period of Si, 
qi is the first pitch period of S 2 , Upi is a feature vector associated with pitch period p x , 
and u q \ is a feature vector associated with pitch period q\. 



38. (Original) The machine-readable medium of claim 37, wherein the calculation for 
the difference between two segments in the voice table, S\ and S 2 , is expanded to include 
a plurality of pitch periods from each segment. 
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39. (Original) The machine-readable medium of claim 37, wherein the difference 
between two segments in the voice table, Si and S2, is associated with a discontinuity 
between Si and S2 . 

40. (Original) The machine-readable medium of claim 36, wherein a difference 

d(S\,S2) between two segments in the voice table, S\ and S 2 , is calculated as 

rf(5i,5 2 ) = I d 0 (pu qi) - d 0 (pu p 1) + d a {q u q 1) | = | C( u P i ,upi) + C( u q i ,Uqi)- C( u P i ,u q i)\ 
2 2 

where d 0 is the distance between pitch periods, p x is the last pitch period of Si , p 1 is the 

first pitch period of a segment contiguous to S\ , q\ is the first pitch period of S2 , q 1 is 

the last pitch period of a segment contiguous to , u P \ is a feature vector associated with 

pitch period pi , u q \ is a feature vector associated with pitch period qi , upi is a feature 

vector associated with pitch period p 1 , and Uq\ is a feature vector associated with pitch 

period q y . 

41. (Previously Presented) The machine-readable medium of claim 25, wherein the 
method further comprises associating the distance between the feature vectors with 
speech segments in a voice table. 

42. (Original) The machine-readable medium of claim 41, wherein the method 
further comprises: 

selecting speech segments from the voice table based on the distance between the 
feature vectors. 
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43. (Original) The machine-readable medium of claim 29, wherein the portions 
include centered pitch periods. 

44. (Previously Presented) The machine-readable medium of claim 43, wherein the 
matrix W is a {2{K-\)+\)M x N matrix represented by 

W= UZV T 

where K-l is the number of centered pitch periods near the segment boundary extracted 
from each segment, N is the maximum number of samples among the centered pitch 
periods, M is the number of segments in a voice table having a segment boundary within 
the phoneme, U is the (2(^-1 )+l)M x R left singular matrix with row vectors 
Hi (1 < / < (2(AT-1)+1)M), 27 is the R x R diagonal matrix of singular values s\> S2> ... 
> s R > 0, Vis the N x R right singular matrix with row vectors y,- (1 < j < N), R « (2(K- 
1)+1)M), and T denotes matrix transposition, wherein decomposing the matrix W 
comprises performing a singular value decomposition of W. 

45. (Original) The machine-readable medium of claim 44, wherein the centered pitch 
periods are symmetrically zero padded to N samples. 

46. (Original) The machine-readable medium of claim 45, wherein a feature vector 
is calculated as 

Ui = UiZ 

where m is a row vector associated with a centered pitch period i, and 27 is the singular 
diagonal matrix. 
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47. (Original) The machine-readable medium of claim 46, wherein the distance 
between two feature vectors is determined by a metric comprising a closeness measure, 
C, between two feature vectors, Uk and Hi , wherein C is calculated as 



C(uk , ut) = cos(uk^, uiZ) = 



'Ml Ml 

for any 1< Jt,Z< (2(K-l)+l)M. 

48. (Original) The machine-readable medium of claim 47, wherein a difference 
d(S\,S2) between two segments in the voice table, Si and S2, is calculated as 

d(S h S 2 ) = C(U 7i-i , U So) + C( U So , U CFi) - C( U fl_ u u - C( U , U (ji) 
where U is a feature vector associated with a centered pitch period 71 -\ , U So is a 
feature vector associated with a centered pitch period £0 , U(j\ is a feature vector 
associated with a centered pitch period <Ti , U710 is a feature vector associated with a 
centered pitch period 71 0 , and U(jo is a feature vector associated with a centered pitch 
period (Jo . 

49. (Currently Amended) An apparatus comprising: 

means for extracting portions from speech segments, the portions surrounding a 
segment boundary within a phoneme; 

means for identifying time samples from the portions; 

means for constructing a matrix W containing first data corresponding to the time 
samples from the portions surrounding the segment boundary within the phoneme and 
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second data corresponding to the portions ; and means for deriving feature vectors that 
represent the portions in a vector space by decomposing the matrix W containing the first 
data corresponding to the time samples from the-the portions surrounding the segment 
boundary within the phoneme and the second data corresponding to the portions- r-s-Beh 
that at le ast phase information of the portions is pr e serv e d in th e fea tur e v e ctors; and 

means for determining a distance between the feature vectors in the vector space. 

50. (Canceled). 

51. (Previously Presented) The apparatus of claim 49, wherein the means for 
decomposing the matrix W comprises means for extracting global boundary-centric 
features from the portions. 

52. (Previously Presented) The apparatus of claim 49, wherein the speech segments 
each include the segment boundary within the phoneme. 

53. (Original) The apparatus of claim 52, wherein the speech segments each include 
at least one diphone. 

54. (Original) The apparatus of claim 53, wherein the portions include at least one 
pitch period. 
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55. (Original) The apparatus of claim 54, wherein the means for decomposing the 
matrix W comprises means for performing a pitch synchronous singular value analysis on 
the pitch periods of the time-domain segments. 

56. (Previously Presented) The apparatus of claim 54, wherein the matrix Wis a 
2KM x ./V matrix represented by 

W= U£V T 

where K is the number of pitch periods near the segment boundary extracted from each 
segment, N is the maximum number of samples among the pitch periods, M is the number 
of segments in a voice table having a segment boundary within the phoneme, U is the 
2KM x R left singular matrix with row vectors «, ( 1 < i < 2KM),Z\$ the R x R diagonal 
matrix of singular values s\> S2> ■■■> sr > 0, V is the N x R right singular matrix with 
row vectors y,- (1 < j < AO, R « 2KM, and T denotes matrix transposition, wherein 
decomposing the matrix W comprises performing a singular value decomposition of W. 

57. (Original) The apparatus of claim 56, wherein the pitch periods are zero padded 
to N samples. 

58. (Original) The apparatus of claim 57, wherein a feature vector w, is calculated as 

Ui = UiZ 

where u t is a row vector associated with a pitch period i, and 27 is the singular diagonal 
matrix. 



Appl. No. 10/693,227 



16/36 



AttyDkt. 4860P3128 



59. (Original) The apparatus of claim 58, wherein the distance between two feature 
vectors is determined by a metric comprising the cosine of the angle between the two 
feature vectors. 



60. (Original) The apparatus of claim 59, wherein the metric comprises a closeness 
measure, C, between two feature vectors, Uk and Hi , wherein C is calculated as 

C(u k , ui) = cos(wfc2; uiE) = | — ^-n — |p rr 

for any 1< k,l< 2KM. 



61. (Original) The apparatus of claim 60, wherein a difference d(S h S 2 ) between two 
segments in the voice table, Si and 52, is calculated as 

d(Si,S 2 ) = doipu qi)=l- C(u P i , u q \ ) 
where do is the distance between pitch periods p\ and q\, p\ is the last pitch period of Si, 
qi is the first pitch period of S2, u P \ is a feature vector associated with pitch period p\ , 
and u q i is a feature vector associated with pitch period q\. 



62. (Original) The apparatus of claim 61, wherein the calculation for the difference 
between two segments in the voice table, Si and S2, is expanded to include a plurality of 
pitch periods from each segment. 
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63. (Original) The apparatus of claim 61, wherein the difference between two 
segments in the voice table, 5i and 52, is associated with a discontinuity between 5i and 
5 2 . 



64. (Original) The apparatus of claim 60, wherein a difference d(Si,S 2 ) between two 
segments in the voice table, 5i and 52, is calculated as 

d(S h S 2 ) = | d 0 (pu qi) - d 0 (pu p i) + d a {q u q i) | = | C( u P i ,upi) + C( u q i ,Uqi)- C( u P i ,u q i)\ 
2 2 

where d 0 is the distance between pitch periods, p x is the last pitch period of 5i , p i is the 

first pitch period of a segment contiguous to 5i , q\ is the first pitch period of 52 , q i is 

the last pitch period of a segment contiguous to 5 2 , u P \ is a feature vector associated with 

pitch period p\ , u q \ is a feature vector associated with pitch period q\ , upi is a feature 

vector associated with pitch period p i , and Uq\ is a feature vector associated with pitch 

period q y . 

65. (Previously Presented) The apparatus of claim 49, further comprising means for 
associating the distance between the feature vectors with speech segments in a voice 
table. 

66. (Original) The apparatus of claim 65, further comprising: 

means for selecting speech segments from the voice table based on the distance 
between the feature vectors. 
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67. (Original) The apparatus of claim 53, wherein the portions include centered pitch 
periods. 

68. (Previously Presented) The apparatus of claim 67, wherein the matrix Wis a 
(2(K-\)+\)M x N matrix represented by 

W= UZV T 

where K-l is the number of centered pitch periods near the segment boundary extracted 
from each segment, N is the maximum number of samples among the centered pitch 
periods, M is the number of segments in a voice table having a segment boundary within 
the phoneme, U is the (2(^-1 )+l)M x R left singular matrix with row vectors 
Hi (1 < / < (2(AT-1)+1)M), 27 is the R x R diagonal matrix of singular values s\> S2> ... 
> s R > 0, Vis the N x R right singular matrix with row vectors y,- (1 < j < N), R « (2(K- 
1)+1)M), and T denotes matrix transposition, wherein decomposing the matrix W 
comprises performing a singular value decomposition of W. 

69. (Original) The apparatus of claim 68, wherein the centered pitch periods are 
symmetrically zero padded to Af samples. 

70. (Original) The apparatus of claim 69, wherein a feature vector w, is calculated as 

Ui = UiZ 

where w, is a row vector associated with a centered pitch period i, and 27 is the singular 
diagonal matrix. 
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7 1 . (Original) The apparatus of claim 70, wherein the distance between two feature 
vectors is determined by a metric comprising a closeness measure, C, between two 
feature vectors, Uk and ui , wherein C is calculated as 



C(uk , ui) = cos(uk^, uiZ) = 



'Ml Ml 

for any 1< k,l< (2(K-l)+l)M. 

72. (Original) The apparatus of claim 71, wherein a difference d(Si,S2) between two 
segments in the voice table, S\ and S 2 , is calculated as 

d(S h S 2 ) = C(U 7i-i , U So) + C( U So , U CFi) - C( U fl_ u u ft () ) - C( U , U (Ji) 
where U is a feature vector associated with a centered pitch period 71 -\ , U So is a 
feature vector associated with a centered pitch period <?o , U(j\ is a feature vector 
associated with a centered pitch period <Ti , is a feature vector associated with a 
centered pitch period 71 o , and W<7 0 is a feature vector associated with a centered pitch 
period (Jo . 

73. (Currently Amended) A system comprising: 

a processing unit coupled to a memory through a bus; and 
wherein the processing unit is configured, for a process, to extract portions from speech 
segments, the portions surrounding a segment boundary within a phoneme, identify time 
samples from the portions; construct a matrix W containing first data corresponding to the 
time samples from the portions surrounding the segment boundary within the phoneme 
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and second data corresponding to the portions , and derive feature vectors that represent 
the portions in a vector space by decomposing the matrix W containing the first data 
corresponding to the time samples from the portions surrounding the segment boundary 
within the phoneme and the second data corresponding to the portions . ; - SHe - b - th - at -- at -- le a st 
phase4nfer«atie«-^ and determine a 

distance between the feature vectors in the vector space. 

74. (Canceled). 

75. (Previously Presented) The system of claim 73, wherein the process further 
causes the processing unit, when decomposing the matrix W, to extract global boundary- 
centric features from the portions. 

76. (Previously Presented) The system of claim 73, wherein the speech segments 
each include the segment boundary within the phoneme. 

77. (Original) The system of claim 76, wherein the speech segments each include at 
least one diphone. 

78. (Original) The system of claim 77, wherein the portions include at least one pitch 
period. 
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79. (Original) The system of claim 78, wherein the process further causes the 
processing unit, when decomposing the matrix W, to perform a pitch synchronous 
singular value analysis on the pitch periods of the time-domain segments. 

80. (Previously Presented) The system of claim 78, wherein the matrix Wis a 2KM x 
/V matrix represented by 

W= UZV T 

where K is the number of pitch periods near the segment boundary extracted from each 
segment, N is the maximum number of samples among the pitch periods, M is the number 
of segments in a voice table having a segment boundary within the phoneme, U is the 
2KM x R left singular matrix with row vectors «, ( 1 < i < 2KM),Z\$ the R x R diagonal 
matrix of singular values s\> S2> ■■■> sr > 0, V is the N x R right singular matrix with 
row vectors vj (1 < j < AO, R « 2KM, and T denotes matrix transposition, wherein 
decomposing the matrix W comprises performing a singular value decomposition of W. 

81. (Original) The system of claim 80, wherein the pitch periods are zero padded to 
N samples. 

82. (Original) The system of claim 81, wherein a feature vector w, is calculated as 

Ui = UiZ 

where u t is a row vector associated with a pitch period i, and 27 is the singular diagonal 
matrix. 
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83. (Original) The system of claim 82, wherein the distance between two feature 
vectors is determined by a metric comprising the cosine of the angle between the two 
feature vectors. 



84. (Original) The system of claim 83, wherein the metric comprises a closeness 
measure, C, between two feature vectors, Uk and ui , wherein C is calculated as 

C(u k , Ul) = C0S(Wfc2; UlE) = t. jp rr 

for any 1< k,l< 2KM. 



85. (Original) The system of claim 84, wherein a difference d(S h S 2 ) between two 
segments in the voice table, Si and 52, is calculated as 

d(Si,S 2 ) = d 0 (pu qi)=l- C(u P i , u q \ ) 
where do is the distance between pitch periods p\ and q\, p\ is the last pitch period of Si, 
qi is the first pitch period of S2, u P \ is a feature vector associated with pitch period p\ , 
and uqi is a feature vector associated with pitch period q\. 



86. (Original) The system of claim 85, wherein the calculation for the difference 
between two segments in the voice table, Si and S2, is expanded to include a plurality of 
pitch periods from each segment. 



87. (Original) The system of claim 85, wherein the difference between two segments 
in the voice table, Si and S 2 , is associated with a discontinuity between Si and S 2 . 
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88. (Original) The system of claim 84, wherein a difference d(S\,S 2 ) between two 
segments in the voice table, 5i and 52, is calculated as 

d(S h S 2 ) = | d 0 (p u q{) - d 0 (p u p{) + d Q (q u q i) | = | C(u P i , m) + C( u«i , Uqi ) - C( u P i ,u q i)\ 
2 2 

where d 0 is the distance between pitch periods, p\ is the last pitch period of 5i , p i is the 

first pitch period of a segment contiguous to 5i , q\ is the first pitch period of 5 2 , q i is 

the last pitch period of a segment contiguous to 5 2 , u P i is a feature vector associated with 

pitch period p\ , u q i is a feature vector associated with pitch period q\ , upi is a feature 

vector associated with pitch period p : , and Tiq\ is a feature vector associated with pitch 

period q i . 

89. (Previously Presented) The system of claim 74, wherein the process further 
causes the processing unit to associate the distance between the feature vectors with 
speech segments in a voice table. 

90. (Original) The system of claim 89, wherein the process further causes the 
processing unit to select speech segments from the voice table based on the distance 
between the feature vectors. 

91. (Original) The system of claim 77, wherein the portions include centered pitch 
periods. 
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92. (Previously Presented) The system of claim 91, wherein the matrix Wis a 
(2(K-1)+1)M x ./V matrix represented by 

w= UZV T 

where K-l is the number of centered pitch periods near the segment boundary extracted 
from each segment, N is the maximum number of samples among the centered pitch 
periods, M is the number of segments in a voice table having a segment boundary within 
the phoneme, U is the (2(K-l)+l)M x R left singular matrix with row vectors 
Ui (1 < i< {2{K-\)+\)M), 27 is the R x R diagonal matrix of singular values si > s 2 > ... 
> sr > 0, V is the N x R right singular matrix with row vectors y,- (1 < j < N), R « (2(K- 
1)+1)M), and T denotes matrix transposition, wherein decomposing the matrix W 
comprises performing a singular value decomposition of W. 

93. (Original) The system of claim 92, wherein the centered pitch periods are 
symmetrically zero padded to N samples. 

94. (Original) The system of claim 93, wherein a feature vector ut is calculated as 

ui = UiZ 

where m is a row vector associated with a centered pitch period i, and 27 is the singular 
diagonal matrix. 

95. (Original) The system of claim 94, wherein the distance between two feature 
vectors is determined by a metric comprising a closeness measure, C, between two 
feature vectors, Uk and Hi , wherein C is calculated as 
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for any 1< k,l< (2(K-1)+1)M. 



96. (Original) The system of claim 95, wherein a difference d(Si,S2) between two 
segments in the voice table, S\ and S 2 , is calculated as 

d(S h S 2 ) = C( U71-1 ,U So) + C(U 8a,U(j\) - C( Ufl-i ,Uj[ 0 ) - C(U (Jq,U(Ji) 

where U71-1 is a feature vector associated with a centered pitch period 71 -1 ,USo is a 

feature vector associated with a centered pitch period So ,U(j\ is a feature vector 

associated with a centered pitch period (Ji , Ujio is a feature vector associated with a 

centered pitch period 71 0 , and U(j 0 is a feature vector associated with a centered pitch 

period Co . 

97. (Currently Amended) A machine-implemented method comprising: 
gathering time-domain samples from recorded speech segments, wherein the 

time-domain samples include time samples of pitch periods surrounding a segment 
boundary within a phoneme; 

constructing a matrix containing first data corresponding to the time samples of 
the pitch periods surrounding the segment boundary within the phoneme and second data 
corresponding to the pitch periods and deriving feature vectors that represent the time 
samples in a vector space by decomposing the matrix containing the first data 
corresponding to the time samples of the pitch periods surrounding the segment boundary 
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within the phoneme and the second data corresponding to the pitch periods ; -- saeh -- that -- at 
■ leas t- pha - se - infeffflati o n o f the time samples is preserved in the featu re- vectors ; 

determining a discontinuity between the segments, the discontinuity based on a 
distance between the features. 

98. (Canceled). 

99. (Previously Presented) The machine-implemented method of claim 97, wherein 
the features incorporate phase information of the pitch periods. 

100. (Canceled). 

101. (Currently Amended) A machine-readable medium having instructions to cause a 
machine to perform a machine-implemented method comprising: 

gathering time-domain samples from recorded speech segments, wherein the 
time-domain samples include time samples of pitch periods surrounding a segment 
boundary within a phoneme; 

constructing a matrix containing first data corresponding to the time samples of 
the pitch periods surrounding the segment boundary within the phoneme and second data 
corresponding to the pitch periods and deriving feature vectors that represent the time 
samples in a vector space by decomposing the matrix containing the first data 
corresponding to the time samples of the pitch periods surrounding the segment boundary 
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within the phoneme and the second data corresponding i ■ pitch j d such that at 
■ leas t - pha - se - infeffflation of the time samples is preserved in the feat ure v ector s ; 

determining a discontinuity between the segments, the discontinuity based on a 
distance between the features. 

102. (Canceled). 

103. (Previously Presented) The machine-readable medium of claim 101, wherein the 
features incorporate phase information of the pitch periods. 

104. (Canceled). 

105. (Currently Amended) An apparatus comprising: 

means for gathering time-domain samples from recorded speech segments, 
wherein the time-domain samples include time samples of pitch periods surrounding a 
segment boundary within a phoneme; 

means for constructing a matrix containing first data corresponding to the time 
samples of the pitch periods surrounding the segment boundary within the phoneme and 
second data corresponding to the pitch periods and deriving feature vectors that represent 
the time samples in a vector space by decomposing the matrix containing the first data 
corresponding to the the time domain samples of the pitch periods surrounding the 
segment boundary within the phoneme and the second data corresponding to the pitch 



Appl. No. 10/693,227 



28/36 



AttyDkt. 4860P3128 



lei i I 'ds ; -- ^ - a<;h --t bat -- at - 4ea^t -- phase - inforniation of the t i me sampl - es - is-pfesefved-i - ft - the 
■ featm : e-- v e e t oi : s -; 

means for determining a discontinuity between the segments, the discontinuity 
based on a distance between the features. 

106. (Canceled). 

107. (Previously Presented) The apparatus of claim 105, wherein the features 
incorporate phase information of the pitch periods. 

108. (Canceled). 

109. (Currently Amended) A system comprising: 

a processing unit coupled to a memory through a bus; and 
a process executed from the memory by the processing unit to cause the processing unit 
to gather time-domain samples from recorded speech segments, wherein the time-domain 
samples include time samples of pitch periods surrounding a segment boundary within a 
phoneme, constructing a matrix containing first data corresponding to the time-domain 
samples of the pitch periods surrounding the segment boundary within the phoneme and 

second diH :\ ^ -n-.liiiL/ if ihc pivh periods and deriving feature vectors that represent 

the time samples in a vector space by decomposing the matrix containing the first data 
corresponding to the time domain samples of the pitch periods surrounding the segment 
boundary within the phoneme and the second data corresponding to th e pitc j 1 pei iod . 
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■ stte - h - th - a t-- at -- le a st -- pS^a - se - 4Bfe - rmation of the time samples is - pi^se - fv - ed - in - the fe - a - to - re 
■ v e c t ors ; and determine a discontinuity between the segments, the discontinuity based on a 
distance between the features. 

110. (Canceled). 

111. (Previously Presented) The system of claim 109, wherein the features incorporate 
phase information of the pitch periods. 

112. (Canceled). 
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